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Feature extraction So that from the extraction of leaf venation features 19 features were obtained. 
Leaf venation Identification of plant species was carried out using Support Vector Machine 
SVM (SVM) with RBF kernel. The learning model was built using 75% of 


the training data. The testing results using 25% of the data on the training 
model, obtained an accuracy of 82.67%, with an average of precision of 84% 
and recall of 83%. 
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1. INTRODUCTION 

Many studies that discuss techniques or methods for identifying plants, show that plant identification 
is important. The study of the physical form and structure of plant bodies, known as plant morphology [1]. 
Plant morphology is useful for visually identifying plants, so that plant diversity can be identified, classified 
and given the right name for each group formed. In addition to describing the shape and composition of 
the plant body, plant morphology is useful for determining the function of each part in plant life. Furthermore, 
the origin and composition of the plant body can be known [2]. Information about plant morphology is needed 
to understand evolution, ecology, geographical distribution, conservation, life cycle, and species definition [3]. 

Parts of plants that have different characteristics from each plant and commonly used for plant 
identification are leaves [4-5]. Leaves have an important role for plants to adapt to their environment [6]. 
The main features on leaves that can distinguish each plant species include shape [7-9], texture [10-13], 
color [14], and leaf venation [15-18] are called leaf biometrics [19]. Among those features, that have unique 
diversity and can describe plant characteristics in more detail is leaf venation, although there are some species 
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of plants that have vague patterns that are not very clear [20]. To obtain leaf venation patterns, it can be done 
using feature extraction techniques through digital image processing [21-22], so that leaf venation features 
are obtained. 

Extraction of leaf venation feature can be done by implementing Self-Invariant Feature Transform 
(SIFT) [23], Fourier and B-Spline modeling [15]. However, the results of segmentation are less than optimal 
because they only reach secondary venation. In addition, many parts of leaf venation are not segmented. 
The method is then improved using the Hessian matrix [24], which is by implementing a vessel measure based 
on the eigenvalues of the Hessian matrix. The result is that the system can segment leaf venation to tertiary 
venation. In Prastya's study [25], the binary venation image of the extracted leaves was calculated for the value 
of straightness, a different angle, length ratio, and scale projection. These values are then used as a marker 
of leaf venation. Other studies conducted by Ambarwari et al. [16], who performed an analysis of leaf venation 
density features to obtain the most important features, which can distinguish types of leaf venation. These 
features by Ambarwari et al. [26] is used to identify plants based on the type of venation. However, from some 
of these studies, no one has identified the plant species. 

In this study, identifying plant species using the leaf venation feature. Leaf venation features were 
obtained through feature extraction. Leaf venation feature extraction will produce several features, 
including straightness, a different angle, length ratio, scale projection [25], total skeleton length, number of 
branching points and number of ending points [27]. This leaf venation feature was used to identify plant species 
by classification techniques. The classification technique used is the Support Vector Machine (SVM). 
In many cases such as pattern recognition, SVM error rates when testing data are significantly 
better than other methods [28]. 


2. RESEARCH METHOD 
2.1. Leaf image acquisition 

The stages in this study consisted of six stages as shown in Figure 1. Leaf image data used are Flavia 
dataset [29] with 6 species were taken, namely Aesculus chinensis, Lagerstroemia indica (L.) Pers., 
Cinnamomum japonicum Sieb., Chimonanthus praecox L., Ilex macrocarpa Oliv., and Koelreuteria paniculata 
Laxm. The number of leaves in each species is 50 leaf images. Leaf image samples are shown in Figure 2. 
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Figure 1. Stages of research 
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Figure 2. Leaf image data, (a) Aesculus chinensis, (b) Lagerstroemia indica (L.) Pers, (c) Cinnamomum 
japonicum Sieb, (d) Chimonanthus praecox L, (e) Ilex macrocarpa Oliv, (f) Koelreuteria paniculata laxm 


2.2. Segmentation 

Image segmentation is the process of separating images into homogeneous parts and extracting these 
parts into objects that will be observed so that the region of interest 1s obtained [30]. From the acquisition of 
leaf image data, then image segmentation was performed using the Hessian matrix [24] to obtain the leaf 
venation shape. Leaf image data from segmentation results in the form of leaf venation binary image data 
shown in Figure 3. 
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Figure 3. Binary venation leaf image data, (a) Aesculus chinensis, (b) Lagerstroemia indica (L.) Pers., (c) 
Cinnamomum japonicum Sieb, (d) Chimonanthus praecox L, (e) Ilex macrocarpa Oliv, (f) Koelreuteria 
paniculata laxm 


2.3. Extraction of leaf venation feature 
The leaf venation image data from the segmentation results, then exraction of the leaf venation feature. 


Leaf venation feature extraction was obtained from the calculation of the value of straightness, different angle, 
length ratio and scale projection [25], in order to calculate these values first, the detection of branch points and 
end points. Illustration of leaf venation feature extraction is shown in Figure 4. 
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Figure 4. Illustration of leaf venation feature extraction [16] 


The notation of Figure 4 is presented below: 
x, y = pixel coordinate 
l; |= length of the j segment that represented by the number of the pixels 
dj; = distance between pixel coordinates (Xs, ys) and (Xe, Ye) 
Straightness is a measurement of the alignment value of a segment. From Figure 4, the straightness 


value is calculated using (1). 


— 


straightness = i (1) 
j 


Different angle (6,) is a measurement of the angle difference between coincident segments. From Figure 4, 
different angle values are calculated using the (2). 


5, = |0;- «| (2) 
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Length ratio (R;) is measured by comparing the length of each segment with the maximum length of 
the segment in a leaf venation image. From Figure 4, the length ratio value is calculated using (3) 


l 


i 
max(I) 


Scale Projection (P;;) 1s a measurement of projection length between coincident segments. In Figure 4, 
the projected length between segments 1 and j is calculated using (4) and (5). 





R; = (3) 
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This leaf venation feature extraction produces several features, including straightness, different angle, length 
ratio, scale projection, skeleton length, segment, total skeleton, number of branching points and number of 
ending points. 


2.4. SVM classification 

The model was built using the SVM classifier with two kernel functions, the RBF and Linear kernels. 
In the RBF kernel function, parameters C and gamma (y). C is a parameter to determine the amount of penalty 
due to an error in the classification of training data, while y is a parameter controlling the width of the Gaussian 
function variant. The parameter C value that was tested was [1, 10, 100, 1000] and the parameter value y that 
was tested was [le-1, le-2, le-3, le-4]. The search for the best parameter values is done using a grid search 
with 5-fold cross-validation in the training data. The best pair of C and y values are obtained based on 
the greatest accuracy value at the time of training the data. The stages of the SVM classification process are 


shown in Figure 5. 
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Figure 5. Stages of the SVM classification process 
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3. RESULTS AND ANALYSIS 

The Flavia dataset [28] used in this method was chosen based on the extraction of the leaf venation 
feature. The Flavia dataset consists of 32 species, there are some species which the leaf venation features cannot 
be extracted. This is because the pattern of leaf venation was not clearly visible, so only leaf venation that has 
a clear pattern was chosen. Then from this dataset, extraction of the leaf venation feature was performed. Data 
resulting from the extraction of leaf venation feature consist of 19 features, including mean, variance, standard 
deviation (of straightness, a different angle, length ratio, scale projection, and length), total skeleton, endpoint, 
branch point, and segment. The value of these features has a different range of values so that it is normalized 
using Min-Max normalization. The data is then divided into training data and testing data with 75% portion of 
training data (225 data) and 25% of testing data (75 data). 

In the training data, the best parameter search was performed to build a training model by applying 
a grid search with 5-fold cross-validation. The best parameter search results are done by pairing the parameters 
that have been prepared. The best parameters are determined based on the highest accuracy. The parameter 
search results using the grid search are shown in Table 1. 

Based on Table 1 obtained the highest accuracy of 77.8% using the RBF kernel. The best parameters 
were selected based on the accuracy, that is the parameter values C = 1000 and gamma (y) = 0.1. 
This combination of kernel and parameters was used to build the SVM learning model. Then the results of tests 
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conducted on the model that has been built are displayed in the form of a confusion matrix. The results of 
the confusion matrix on the testing data against the SVM learning model are shown in Figure 6. 


Table 1. Search results of parameters using the grid search 


Kernel C Gamma Accurac 
rbf 1 0.1 0.600 
rbf 1 0.01 0.413 
rbf 1 0.001 0.413 
rbf 1 0.0001 0.413 
rbf 10 0.1 0.742 
rbf 10 0.01 0.596 
rbf 10 0.001 0.413 
rbf 10 0.0001 0.413 
rbf 100 0.1 0.751 
rbf 100 0.01 0.738 
rbf 100 0.001 0.600 
rbf 100 0.0001 0.413 
rbf 1000 0.1 0.778 
rbf 1000 0.01 0.751 
rbf 1000 0.001 0.738 
rbf 1000 0.0001 0.600 

linear 1 - 0.711 
linear 10 - 0.764 
linear 100 - 0.760 
linear 1000 - 0.764 
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Figure 6. Confusion matrix 


Based on the confusion matrix Figure 6 it can be seen that of the six species tested only Koelreuteria 
paniculata Laxm. species, where all the data is correctly classified. Whereas the other species on average have 
misclassified between 2 to 3 misclassified data. Overall, the average accuracy obtained in testing data is 
82.67%. This accuracy value is far better than identifying leaves based on the type of leaf venation done by 
Ambarwari et al. [16]. In the research of Ambarwari et al. [16] the highest accuracy was obtained at 77.57%. 
One reason for the small accuracy is the amount of data that is not the same in each category. In addition, 
the type of leaf venation consists of several different species, so it is possible have different features of leaf 
venation. Detailed results of the SVM classification are shown in Table 2. 

Besides accuracy, to see the performance of a classifier we need precision and recall values. Based on 
Table 2 it can be seen that the species of Aesculus chinensis and Lagerstroemia indica (L.) Pers. which has 
precision and recall values below 80%. This means that the data of the two species affect the decreases of 
accuracy. Even so, the average precision and recall obtained were quite high, namely 84% for the value of 
precision and 83% for the value of recall. This result is very small compared to the study of Wang et al. [29], 
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who used the leaf shape feature for plant identification. However, the leaf venation feature can be used to obtain 
information related to plant anatomy such as the location of plants growt and environmental conditions [16]. 


Table 2. Detailed results of the SVM classification 


Species Precision Recall Fl-score Support 

Aesculus chinensis 0.60 0.75 0.67 12 
Lagerstroemia indica (L.) Pers. 0.71 0.77 0.74 13 
Cinnamomum japonicum Sieb. 0.92 0.85 0.88 13 
Chimonanthus praecox L. 0.92 0.92 0.92 12 
Ilex macrocarpa Oliv. 0.90 0.69 0.78 13 
Koelreuteria paniculata Laxm. 1.00 1.00 1.00 12 

Average / total 0.84 0.83 0.83 75 


4. CONCLUSION 

Identification of plant species based on leaf venation features was carried out using the SVM classifier. 
The application of RBF kernel with parameters C and gamma (y) in SVM, is able to classify plants with 
an accuracy of 82.67%. In addition to accuracy, performance on SVM is also measured by an average of 
precision and recall which is 84% for precision and 83% for recall. The use of leaf venation as a feature in 
the identification of plant species can be used as an alternative when the leaves of plants have similar shapes 
and textures. As for suggestions on studies related to plant identification, the process of identification using 
the feature of leaf venation requires quite a long time, especially at the segmentation stage. The duration of leaf 
segmentation 1s due to the manually determined threshold value, meaning that each leaf threshold value is 
different. On the other hand, information regarding leaf venation features can be obtained. In addition, if 
the focus of research is on identifying and not caring about plant conditions, another method that can be applied 
for plant identification is the convolutional neural network (CNN). In terms of speed is superior, but 
the specifications of the devices used are also higher. 
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